FEAT Add SALAD-Bench dataset loader by romanlutz · Pull Request #1425 · Azure/PyRIT

romanlutz · 2026-03-01T14:31:26Z

Add remote dataset loader for SALAD-Bench (walledai/SaladBench), a hierarchical safety benchmark with ~30k prompts organized into 6 domains, 16 tasks, and 65+ categories (ACL 2024).

Copilot

Pull request overview

Adds a new remote seed dataset loader for the SALAD-Bench HuggingFace dataset, making it available through PyRIT’s automatic SeedDatasetProvider discovery and documenting it in the dataset-loading guide.

Changes:

Added _SaladBenchDataset remote loader that fetches SALAD-Bench from HuggingFace and converts rows into SeedPrompts.
Registered the loader for auto-discovery via pyrit.datasets.seed_datasets.remote.__init__.
Added unit tests and updated the “Loading Built-in Datasets” notebook to show the new dataset name.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
`pyrit/datasets/seed_datasets/remote/salad_bench_dataset.py`	New HuggingFace-backed loader that maps SALAD-Bench entries into `SeedDataset`/`SeedPrompt`.
`pyrit/datasets/seed_datasets/remote/__init__.py`	Imports/exports `_SaladBenchDataset` so it’s registered and discoverable.
`tests/unit/datasets/test_salad_bench_dataset.py`	Unit tests validating dataset fetching and config passthrough behavior.
`doc/code/datasets/1_loading_datasets.ipynb`	Documentation notebook updated to reflect the new dataset in the available list (but currently includes executed outputs/metadata).

Comments suppressed due to low confidence (1)

pyrit/datasets/seed_datasets/remote/salad_bench_dataset.py:74

The authors list formatting is inconsistent with other remote dataset loaders and is hard to read (and likely exceeds the repo’s 120-char line length). Please format the authors list across multiple lines (one author per line) like other dataset loaders for readability and consistent styling.

            dataset_name=self.hf_dataset_name,
            config=self.config,

doc/code/datasets/1_loading_datasets.ipynb

Add remote dataset loader for SALAD-Bench (walledai/SaladBench), a hierarchical safety benchmark with ~30k prompts organized into 6 domains, 16 tasks, and 65+ categories (ACL 2024). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

Comments suppressed due to low confidence (1)

doc/code/datasets/1_loading_datasets.ipynb:242

The notebook metadata was updated to a different local Python version. To avoid unnecessary diffs across environments, consider reverting/normalizing kernel metadata (or stripping it) in committed docs notebooks.

   "version": "3.13.5"

tests/unit/datasets/test_salad_bench_dataset.py

doc/code/datasets/1_loading_datasets.ipynb

The HF dataset identifier is now a class constant HF_DATASET_NAME instead of a constructor parameter, consistent with other loaders. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (1)

doc/code/datasets/1_loading_datasets.ipynb:200

This notebook output includes a DeprecationWarning with a user-specific temporary file path (C:\\Users\\...\\AppData\\Local\\Temp\\...). Please clear/sanitize this output (and ideally avoid emitting the warning in the example) so docs are reproducible and don't embed local filesystem paths.

      "C:\\Users\\romanlutz\\AppData\\Local\\Temp\\ipykernel_40808\\4021500943.py:10: DeprecationWarning: is_objective parameter is deprecated since 0.13.0. Use seed_type='objective' instead.\n",
      "  memory.get_seeds(harm_categories=[\"illegal\"], is_objective=True)\n"
     ]

pyrit/datasets/seed_datasets/remote/salad_bench_dataset.py

Wrapping in Jinja2 raw tags preserves original dataset text that may contain {{ }} or {% %} syntax. Also precomputes loop constants. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

Copilot · 2026-03-02T14:19:33Z

doc/code/datasets/1_loading_datasets.ipynb

    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
-      "\r\n",
-      "Loading datasets - this can take a few minutes:   0%|          | 0/49 [00:00<?, ?dataset/s]"
+      "\r",
+      "Loading datasets - this can take a few minutes:   0%|          | 0/50 [00:00<?, ?dataset/s]"
     ]


This notebook cell output includes run-specific progress bar output (dataset counts/timings) captured in stderr. Please clear outputs before committing to keep docs deterministic and reduce noisy diffs.

Copilot AI review requested due to automatic review settings March 1, 2026 14:31

Copilot started reviewing on behalf of romanlutz March 1, 2026 14:31 View session

romanlutz force-pushed the romanlutz/add-salad-bench-dataset branch from 95af585 to 99ab63b Compare March 1, 2026 14:32

Copilot AI reviewed Mar 1, 2026

View reviewed changes

doc/code/datasets/1_loading_datasets.ipynb Outdated Show resolved Hide resolved

doc/code/datasets/1_loading_datasets.ipynb Show resolved Hide resolved

romanlutz force-pushed the romanlutz/add-salad-bench-dataset branch from 99ab63b to 7db0e9c Compare March 1, 2026 14:53

Add SALAD-Bench dataset loader

bbc66cf

Add remote dataset loader for SALAD-Bench (walledai/SaladBench), a hierarchical safety benchmark with ~30k prompts organized into 6 domains, 16 tasks, and 65+ categories (ACL 2024). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

romanlutz force-pushed the romanlutz/add-salad-bench-dataset branch from 7db0e9c to bbc66cf Compare March 2, 2026 13:08

Copilot AI review requested due to automatic review settings March 2, 2026 13:08

Copilot started reviewing on behalf of romanlutz March 2, 2026 13:08 View session

Copilot AI reviewed Mar 2, 2026

View reviewed changes

tests/unit/datasets/test_salad_bench_dataset.py Outdated Show resolved Hide resolved

tests/unit/datasets/test_salad_bench_dataset.py Outdated Show resolved Hide resolved

doc/code/datasets/1_loading_datasets.ipynb Show resolved Hide resolved

romanlutz and others added 2 commits March 2, 2026 05:41

Remove dataset_name from constructor, hardcode as class constant

c91c587

The HF dataset identifier is now a class constant HF_DATASET_NAME instead of a constructor parameter, consistent with other loaders. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Use AsyncMock for _fetch_from_huggingface in tests

293133c

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings March 2, 2026 13:53

Copilot started reviewing on behalf of romanlutz March 2, 2026 13:53 View session

Copilot AI reviewed Mar 2, 2026

View reviewed changes

pyrit/datasets/seed_datasets/remote/salad_bench_dataset.py Show resolved Hide resolved

romanlutz and others added 2 commits March 2, 2026 06:04

Wrap prompt values in raw/endraw, precompute source_url and groups

1c0e75e

Wrapping in Jinja2 raw tags preserves original dataset text that may contain {{ }} or {% %} syntax. Also precomputes loop constants. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Fix ruff formatting

0f03dc2

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings March 2, 2026 14:11

Copilot started reviewing on behalf of romanlutz March 2, 2026 14:11 View session

Copilot AI reviewed Mar 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT Add SALAD-Bench dataset loader#1425

FEAT Add SALAD-Bench dataset loader#1425
romanlutz wants to merge 5 commits intoAzure:mainfrom
romanlutz:romanlutz/add-salad-bench-dataset

romanlutz commented Mar 1, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

romanlutz commented Mar 1, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants